2882 



IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 51, NO. 11, NOVEMBER 2003 



Pilot-Assisted Estimation of MIMO Fading Channel 
Response and Achievable Data Rates 

Dragan Samardzija and Narayan Mandayam, Senior Member, IEEE 



Abstract — We analyze the effects of pilot-assisted channel esti- 
mation on achievable data rates (lower bound on information ca- 
pacity) over a frequency flat time-varying channel. Under a block- 
fading channel model, the effects of the estimation error are eval- 
uated in the case of the estimates being available at the receiver 
only (open loop) and in the case when the estimates are fed back 
to the transmitter allowing water pouring transmitter optimiza- 
tion (closed loop). Using a characterization of the effective noise 
due to estimation error, we analyze the achievable rates as a func- 
tion of the power allocated to the pilot, the channel coherence time, 
the background noise level, as well as the number of transmit and 
receive antennas. The analysis presented here can be used to op- 
timally allocate pilot power for various system and channel oper- 
ating conditions, and to also determine the effectiveness of closed 
loop feedback. 

Index Terms — Ergodic capacity, estimation, MIMO, water 
pouring. 

I. Introduction 

FADING channels are an important element of any wireless 
propagation environment [1]. Different aspects of fading 
channels have been studied and publicized. It has been recog- 
nized that the inherent temporal and spatial variations of wire- 
less channels impose stringent demand on design of a communi- 
cation system to allow it approach the data rates that are achiev- 
able in, for example, wire-line systems. A number of different 
solutions exploit variations in wireless channels. For example, a 
transmitter optimization scheme (using power control), known 
as the water pouring algorithm, maximizes the capacity for the 
constrained average transmit power [2] (see also [3]). In addi- 
tion to power control, recent applications of variable coding rate 
and modulation formats illustrate a wide range of resource allo- 
cation techniques used to exploit and combat effects of fading 
channels in multiuser wireless systems [4]. An extensive re- 
view of the information theoretical aspects of communications 
in fading channels is given in [5]. Furthermore, modulation and 
channel coding for fading channels is also being studied (see [5] 
and references therein). 

Multiple-transmit multiple-receive antenna systems represent 
an implementation of the MIMO concept in wireless communi- 
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cations [6]. This particular multiple antenna architecture pro- 
vides high-capacity wireless communications in rich scattering 
environments. It has been shown that the theoretical capacity 
(approximately) increases linearly as the number of antennas is 
increased [6], [7]. This and related results point to the impor- 
tance of understanding all aspects of MIMO wireless systems. 
For example, the studies regarding propagation [8]— [10], detec- 
tion [11], [12], space-time coding, and implementation aspects 
[13]— [15] are well publicized. 

Next-generation wireless systems and standards are supposed 
to operate over wireless channels whose variations are faster 
and/or further pronounced. For example, using higher carrier 
frequencies (e.g., 5 GHz for 802.11a) results in smaller scale 
of spatial variations of the electro-magnetic field. In addition, 
compared with SISO channels, MIMO channels have greater 
number of parameters which which a receiver and/or transmitter 
has to operate, consequently pronouncing the channel varia- 
tions. In addition, there has been a perpetual need for supporting 
higher mobility within wireless networks. These are just a few 
motivations for studying the implications of channel variations 
on achievable data rates in wireless systems. 

In this paper, we analyze how the estimation error of the 
channel response affects the performance of a MIMO wireless 
system. Considering the practical importance of single-input 
single-output (SISO) systems, we analyze them as a subset 
of MIMO systems. Considering terminology in literature (see 
[5] and references therein), the channel response estimate 
corresponds to channel state information (CSI). We assume 
a frequency-flat time-varying wireless channel with additive 
white Gaussian noise (AWGN). More precisely, a quasistatic 
block-fading channel model is used. Furthermore, the temporal 
variations of the channel are characterized by the correlation 
between successive channel blocks. The above system may 
also correspond to one subchannel (i.e., carrier) of an OFDM 
wireless system [16]. We consider two pilot (training) arrange- 
ment schemes in this paper. The first scheme uses a single pilot 
symbol per block with the different power than the data symbol 
power. The second scheme uses more than one pilot symbol 
per block, whose power is the same as the data symbol power. 
For the given pilot schemes, in both cases, maximum-likelihood 
(ML) estimation of the channel response is considered [17]. In 
the MIMO case, the orthogonality between the pilots assigned 
to different transmit antennas is assumed. The effects of the 
estimation error are evaluated in the case of the estimates 
being available at the receiver only and in the case when the 
estimates are fed back to the transmitter allowing water pouring 
optimization. The presented analysis may be viewed as a study 
of mismatched receiver and transmitter algorithms in MIMO 
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systems. The analysis connects results of Inform. Theory 
(see [18], [19], and references therein) with practical wireless 
communication systems (employing pilot-assisted channel 
estimation) and generalizing it to MIMO systems. Previously 
published studies on MIMO channel estimation, and its effects 
include [20] and [21]. An elaborate informationtheoretical 
study analyzing different training schemes, and optimizing 
their parameters to maximize the open-loop MIMO capacity 
lower bounds, is also presented in [22]. We will highlight the 
similarities and differences of the work presented here to that 
in [22] in Sections II-VII of this paper. We believe that the 
results presented here are directly applicable to current and 
next-generation wireless systems [13]— [15], [23]. Furthermore, 
the results may be used as baseline benchmarks for perfor- 
mance evaluation of more advanced estimation and transmitter 
optimization schemes, such as that which is anticipated in 
future systems. 

II. System Model 

In the following, we present a MIMO communication system 
that consists on M transmit and N receive antennas (denoted as 
a M X N system). At the receiver, we assume sampling with 
the period T smp = 1/B, where B is the signal bandwidth, thus 
preserving the sufficient statistics. The received signal is a spa- 
tial vector y 

y(fc) =H(fc)x(fc) + n(fc),y(fc) G C N ,x(k) G C M 

n(k)eC N ,n(k)eC NxM (i) 

where x(fc) = [gi(k) ■ ■ ■ gM(k)] T is the transmitted vector, 
n(k) = [n^k) ■ • ■n,N(k)] T is the AWGN vector with 
(E [n(k)n(k) H ) = N 0 Inxn), and H(fc) is the MIMO 
channel response matrix, all corresponding to the time instance 
k. We assign index rn = 1,...,M to denote the transmit 
antennas and index n = l,...,N to denote the receive 
antennas. Thus, h nm (k) is the nth row and mth column 
element of the matrix H(fc). Note that it corresponds to a SISO 
channel response between the transmit antenna m and the 
receive antenna n. g m (k) is the transmitted signal from the mth 
transmit antenna The nth component of the received spatial 
vector y(k) = [yi(k) ■ ■ -yN(k)] T (i.e., signal at the receive 
antenna n) is 

M 

Vn{k) = ^ h nm (k)g m (k) + n n (k). (2) 

m=l 

To perform estimation of the channel response H(fc), the re- 
ceiver uses a pilot (training) signal that is a part of the trans- 
mitted data. The pilot is sent periodically (every K sample pe- 
riods). We consider the transmitted signal to be comprised of 
two parts: One is the data bearing signal, and the other is the 
pilot signal. Within the pilot period consisting of K symbols, 
L symbols (i.e., signal dimensions) are allocated to the pilot per 
transmit antenna. As a common practical solution (see [13]— [15] 
and [24]), we assume that the pilot signals assigned to the dif- 
ferent transmit antennas are mutually orthogonal. For more de- 
tails on signal design for multiple transmit antenna systems, 



see also [25] and [26]. This assumption requires that K > 
LM. Consequently, we define a if -dimensional temporal vector 
g™ = [fl-m(l) • • • g m (K )] T , whose fcth component is g m (k) [in 
(2)], as 

K-LM L 

e: - V a? <fi s^ + S^a p - £ s p - (3) 

i=l j=l 
V v ' v v ' 

Data pilot 

In the above, the first sum is the information, i.e., data bearing 
signal and the second corresponds to the pilot signal, corre- 
sponding to the transmit antenna m. Superscripts "d" and "p" 
denote values assigned to the data and pilot, respectively. df m 
is the unit-variance circularly symmetric complex data symbol. 
The pilot symbols (d^ m , j = 1, . . . , L) are predefined and 
known at the receiver. Without loss of generality, we assume 
that \dP jm \ = 1. We also assume that the amplitudes are 
af m = A and a P m = Ap and that they are known at the 
receiver. Further, the amplitudes are related as Ap = aA. 
Note that the amplitudes are identical across the transmit 
antennas (because we assumed that the transmit power is 
equally distributed across them). 

Furthermore, sf = [sf(l) ■ ■ ■ sf(K)] T , (i = 1, . . . , (K - 

LM)), s p m = [s p m (l) ■ ■ ■ s p m (K)] T (j = 1, . . . , L, and m = 
1, . . . M) are waveforms, which are denoted as temporal sig- 
natures. The temporal signatures are mutually orthogonal. For 
example, sf (or s p m ) could be a canonical waveform such as a 
TDMA-like waveform, where sf (or s p m ) is the unit-pulse at the 
time instance i. Alternately, sf (or s p m ) could also be a K-di- 
mensional CDMA sequence spanning all K sample intervals 
[23]. Note that the above model, while being general enough, 
is particularly suitable for MIMO implementations over CDMA 
systems (see [24]). 

As stated earlier, we assume that the pilot signals are orthog- 
onal between the transmit antennas. The indexing and summa- 
tion limits in (3) conform to this assumption, i.e, temporal signa- 
tures s p m (j = 1, . . . , L) are uniquely assigned to the transmit 
antenna m. In other words, transmit antenna m must not use 
the temporal signatures that are assigned as pilots to other an- 
tennas and assigned to data, which is consequently lowering the 
achievable data rates (will be revisited in Sections III- VII). Un- 
like the pilot temporal signatures, the data bearing temporal sig- 
natures sf (i = 1, . . . , (K—LM)) are reused across the transmit 
antennas, which is an inherent property of MIMO systems, po- 
tentially resulting in high achievable data rates. It is interesting 
to note that the assumptions regarding the orthogonality be- 
tween the pilots (motivated by practical considerations) are also 
shown to be optimal in [22], maximizing the open-loop capacity 
lower bound. Similar conclusions are drawn in [20] and [26]. We 
rewrite the received spatial vector in (1) as 

y(fe) =H(fe)(d(fe) + p(fe)) + n(fe) 

d(fc) GC M , p(fc) G C M (4) 

where d(k) is the information, i.e., data bearing signal and p(k) 
is the pilot portion of the transmitted spatial signal, at the time 
instance k. The mth component of the data vector d(k) = 



2884 



IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 51, NO. 11, NOVEMBER 2003 



[di(k) ■ ■ ■ dM(k)] T (i.e., data signal at the transmit antenna m) 



is 



K-LM 



d m (k)= < m dL4(k). 



(5) 



i=l 



The mth component of the pilot vector p(k) = 
[pi(k) ■ ■ ■ pM(k)] T (i.e., pilot signal at the transmit antenna 

to) is 



(6) 



Let us now describe the assumed properties of the MIMO 
channel H(fc). The channel coherence time is assumed to be 
greater or equal to KT smp . This assumption approximates the 
channel to be constant over at least K samples (h nm (k) « h nm , 
for A; = 1, . . . , K , for all m and n), i.e., approximately con- 
stant during the pilot period. In the literature, channels with the 
above property are known as block-fading channels [16]. Fur- 
thermore, we assume that the elements of H are independent 
identically distributed (iid) random variables, corresponding to 
highly scattering channels. In general, the MIMO propagation 
measurements and modeling have shown that the elements of 
H are correlated (i.e., not independent) [8]— [10]. The effects 
of correlation on the capacity of MIMO systems is studied in 
[27]. We assume that independence is a common practice be- 
cause the information about correlation is usually not available 
at the receiver and/or the correlation is time varying (not sta- 
tionary) and hard to estimate. Based on the above, the received 
temporal vector at the receiver n, whose kth component is y n (k) 
(in (2)), is 

M 

?n = • • • Vn(K)Y = ^ hnmgm + n„, T n <E C K (7) 

m=l 

where n n = [n n (l) ■ ■ ■ n n (K)] T , and £[n„n^] = N 0 I KxK . 

Note that when applying a different number of transmit an- 
tennas, the total average transmitted power must stay the same, 
i.e., conserved. This is a common assumption in MIMO sys- 
tems [6], [7]. In addition, the power is equally distributed across 
the transmit antennas. The average transmit power (from all 
transmit antennas) is 

f K-LM „ L „\ 



P a v =M- 



i=l 



3=1 



K 



Thus 



--M 



A = 



((K - LM) + La 2 ) A 2 



K 



I K_ 

((K - LM) 



p 

a 2 L) H 



(8) 



(9) 



As seen from the above, we assume that the total average trans- 
mitted energy (within the pilot period) is the same but differ- 
ently distributed between the data bearing portion of the signal 
and the pilot. Consequently, we observe the performance of the 
system with respect to the amount of transmitted energy that is 



allocated to the pilot (percentage wise). This percentage is de- 
noted as n and is given as 



La 2 



(K - LM) + La 2 



100 [%]. 



(10) 



As stated earlier, in this study, we consider two different pilot 
arrangements. 

1) L = 1 and A P ^ A. The amplitude is 



K 



((K — M) + a 2 ) M 



(11) 



2) 



In the remainder of the paper, the above pilot arrangement 
is referred to as case 1. For example, in SISO systems, 
the above pilot arrangement is applied in CDMA wireless 
systems (e.g., IS-95 and WCDMA [23]). In MIMO sys- 
tems, it is applied in narrowband MIMO implementations 
described in [13]— [15]. It is also applied in a wideband 
MIMO implementation based on 3-G WCDMA [24]. 
Note that under certain assumptions to be pointed out 
in Section III, the above pilot arrangement scheme is 
equivalent to the scheme in [22] 
L > 1 and Ap = A (a = 1). The amplitude is 



K 



(K - L(M - 1)) M 



(12) 



In the remainder of the paper, the above pilot arrangement 
is referred to as case 2. Note that the above pilot arrange- 
ment is frequently used in SISO systems, e.g., wire-line 
modems [28] and some wireless standards (e.g., IS- 136 
and GSM [16]). This arrangement is typically not used in 
MIMO systems. 
In Section VI, we will analyze the performance of these two 
cases because they are widely applied in different communica- 
tion systems. 

III. Estimation of Channel Response 

Due to the orthogonality of the pilots and assumption that 
the elements of H are iid, it can be shown that to obtain the 
maximum likelihood estimate of H, it is sufficient to estimate 
hnm (for Tn = 1, . . . , M, n = 1,...,N) independently. 1 This 
is identical to estimating a SISO channel response between the 
transmit antenna m and receive antenna n. The estimation is 
based on averaging the projections of the received signal on 
1 (for j — 1, . . . , L and m — 1, . . . , M) as 



d p - s 

jiii j m 



1 L H 
m = LA (^jm^m) V n 

i L 

= Km + LA^^^ 



s p - 



(13) 



'Based on the above assumptions, it can be shown that in this particular case, 
the ML estimate is equal to the LMMSE estimate considered in [22]. 
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where h nm denotes the estimate of the channel response h nm . It 
can be shown that for a frequency-flat AWGN channel, given the 
pilot signal and the assumed properties of H, (13) is the max- 
imum-likelihood estimate of the channel response h nm [17]. 
The estimation error is 

L 



LA 



Note that n e nra corresponds to a Gaussian random variable with 
distribution M (0, N 0 / (L(aA) 2 )). Thus, the channel matrix 
estimate H is 



H = H + H e 



(15) 



where H e is the estimation error. Each component of the error 
matrix H e is an iid random variable n^ m given in (14) (where 
n^m is the nth row and mth column element of H e ). 

Having the channel response estimated, the estimate of the 
transmitted data that is associated with the temporal signature 
sf is obtained starting from the following statistics: 



A 



(16) 



where the amplitude A is assumed to be known at the receiver. 
z n i corresponds to the nth component of the vector 



[Zli 



■ z Ni ] T = Hdi + -n;, i = l,...,K- LM (17) 



where the mth component of = [d^ • ■ • df M ] T is df m (data 
transmitted from the antenna m and assigned to the temporal 
signature sf). Further, E [n^nf*] = NoIn x n- It can be shown 
that Zi is a sufficient statistic for detecting the transmitted data. 
Using Zi, a MIMO receiver would perform detection of the 
transmitted data. Detection of the spatially multiplexed data, 
which is is not a focus of this paper, can be done, for example, 
using the VBLAST algorithm [11], [24], 

As a common practice, the detection procedure assumes that 
the channel response is perfectly estimated and that H corre- 
sponds to the true channel response. Let us rewrite the expres- 
sion in (17) as 



Zi =(H + H e )di + — iii - H e d; 



=Hdi + ( -n; - H e d 



(18) 



The effective noise in the detection procedure (as a spatial 
vector) is 



n; 



-n,- - H fi d, 



(19) 



For the given H, the covariance matrix of the effective noise 
vector is 



T = T(A) = E g [fUBf] = ^I + J5 g 



A 2 ' 



H e H, 



H 



(20) 

and it is a function of the amplitude A. As stated earlier, H e 
is a matrix of iid Gaussian random variables with distribution 
Af(0,N 0 / (L(aA) 2 )). 



It can be shown that for a Rayleigh channel, where the en- 
tries of H are iid Gaussian random variables with distribution 
Af(0, 1), the above covariance matrix is 

„ An, .. 1 . ( 1 



A 2 



I + M- 



1 + 



L(aA) 2 ■ 
No 



1 + 



L(aA) 2 
N 0 



xHH 



H 



(21) 



IV. Estimates Available to Receiver: 
Open-Loop Capacity 

Assuming that the channel response estimate is available to 
the receiver only, we determine the lower bound for the open- 
loop ergodic capacity as follows: 



C> R = 



K-LM 



K 



£g [log 2 det(] 



MxM 



+ HH^T" 



')] 



(22) 

The term (K — LM)/K is introduced because L temporal sig- 
nature per each transmit antenna are allocated to the pilot. In ad- 
dition, the random process H has to be stationary and ergodic 
(this is a common requirement for fading channel and ergodic 
capacity [5], [29]). We assume that the channel coding will span 
across great number of channel blocks (i.e., we use the well 
known infinite channel coding time horizon, which are required 
to achieve error-free data transmission with rates approaching 
capacity [30]). 

In the above expression, equality holds if the effective noise 
[given in (19)] is AWGN with respect to the transmitted signal. 
If the effective noise is not AWGN, then the above rates repre- 
sent the worst-case scenario, i.e., the lower bound [22], [31]. In 
achieving the above rates, the receiver assumes that the effective 
noise is interference (which is independent of the transmitted 
data) with a Gaussian distribution and spatial covariance matrix 
T. In addition, in the above expression, R represents an achiev- 
able rate for reliable transmission (error-free) for the specific 
estimation procedure assumed. Knowing the channel response 
perfectly or using a better channel estimation scheme (e.g., de- 
cision driven schemes) may result in higher achievable rates. 

Note that the capacity lower bounds for MIMO channel esti- 
mation independently derived in [22], assuming the time mul- 
tiplexing of data and pilot (i.e., training) symbols. The authors 
also present analytical results on the optimal properties required 
of the training sequences, their duration, and power. The signal 
model presented here is more general than that, and the dis- 
tinguishing feature of this paper is the mismatched closed-loop 
transmission analysis presented in the Section V. 

In the following, we compare the above result in (22) with 
an information theoretical result presented in [5, p. 2641, ex- 
pression (3.3.55)]. The result is presented for the conventional 
SISO case, introducing a capacity lower bound for mismatched 
decoding as 



C > R* = Et 



log 2 1 + 



h 2 P 



h \h (\h-h\ 2 )P + N () 



E; 



(23) 

where h and h are the SISO channel response and its estimate, 
respectively. The above result is quite general, not specifying 
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the channel response estimation procedure. The bound in (22) is 
an extension of the information theoretical bound in (23), cap- 
turing the more practical pilot-assisted channel response esti- 
mation scheme and generalizing it to the MIMO case. Conse- 
quently, we have the following proposition. 

Proposition 1: For the SISO case (M = 1, N = 1), the rate 
R in (22) and R* in (23) are related as 

R=i nr Lir >** p= (K-?) + «>L p » (24) 

where h is obtained using the pilot-assisted estimation. 

V. Estimates Available to Transmitter and Receiver: 
Closed-Loop Capacity 

In MIMO systems, when the channel state H is perfectly 
known at the transmitter, to maximize the capacity (under con- 
strained transmit power), the transmitter performs optimization 
known as water pouring on eigen modes. For SISO systems, 
the water pouring algorithm is given in [2]. In practical com- 
munication systems, the channel state H has to be estimated 
at the receiver and then fed to the transmitter. In the case of a 
time-varying channel, this practical procedure results in noisy 
and delayed (temporally mismatched) estimates being available 
to the transmitter to perform the optimization. 

As stated earlier, the MIMO channel is time varying. Let 
Hi_i and Hi correspond to consecutive block-faded channel 
responses. In the following, the subscripts i and i — 1 on dif- 
ferent variables will indicate values corresponding to the block 
channel periods i and i — 1, respectively. The temporal charac- 
teristic of the channel is described using the correlation 

f ~ K 



(25) 



where Y = E[hi nm h* nm ], and hi nm is a stationary random 
process (for m = 1,...,M and n = 1,...,N, denoting 
transmit and receive antenna indices, respectively). We assume 
that the knowledge of the correlation k is not known at the 
receiver and the transmitter. Note that the above channel is 
modeled as a first-order discrete Markov process. 2 

Adopting a practical scenario, we assume that the receiver 
feeds back the estimate Hi_i. Because the ideal channel state 
Hi is not available at the transmitter, we assume that Hi_i is 
used instead to perform the water pouring transmitter optimiza- 
tion for the ith block. In other words, the transmitter is ignoring 
the fact that Hi / Hi_i. 

The water pouring optimization is performed as follows. 
First, the estimate is decomposed using singular value decom- 
position (SVD) as Hi_i = Ui-iSi-iV^i [32]. Then, if the 
data vector d(k) is to be transmitted (in (4)), the following 
linear transformation is performed at the transmitter: 



d(k) = Vi^Sidik) 



(26) 



where the matrix S; is a diagonal matrix whose elements Sijj 
(j = 1, . . . M) are determined by the water pouring algorithm 
per singular value of Hi_i, i.e., the diagonal element of £i_i 

2 Note that in the case of Jake's model, k = J 0 (2-Kf d T), where f d is the 
maximum Doppler frequency, and r is the time difference between /i(i_i)„ m 
and hi nrn . 



(which are denoted as <T(i-i)jj, j 
element of S,- is defined as 



, M). The diagonal 



70 

0. 



N 0 



l^-D^FA^ for l%-i)jjl No- >7o (27) 
otherwise 

where 70 is a cut-off value, and it depends on the channel fading 
statistics. It is selected such that the average transmit power 
stays the same P av [2]. Consequently, at the time instant k, the 
received spatial vector is 

y(fc) =H l V i _ 1 S i d(A;) + H iP (fc) + n(fc) 

=Gd(fc) + H lP (k) + n(fc) (28) 

and 



G — HiVi_iSi. 



(29) 



The water pouring optimization is applied on the data bearing 
portion of the signal d(k), whereas the pilot p(k) is not 
changed. The receiver knows that the transformation in (26) is 
performed at the transmitter. The receiver performs estimation 
of the channel response matrix as given in Section III, resulting 
in G = H^Vi-iSi and the error matrix G e = H ei Vi_iSi. In 
this case, the effective noise in (19) and its covariance matrix 
in (20) are modified accordingly, resulting in 



N 0l 



T^ = T^(A) = ^I + ^ |G 



G e G, 



H 



(30) 



In the above and following expressions, the superscript " WP" 
denotes water pouring. Note that the above application of the 
water pouring algorithm ger eigen mode is suboptimal, i.e., it 
is mismatched (because Hi_i is used instead of Hi). Conse- 
quently, the closed-loop system capacity is bounded as 



C 



WP 



>R WP = 



K-LM 
K 



log 2 det (Imxm+ 

gg h (r WP y 1 ) 



(31) 



Similar to the comments related to the result in (22), the random 
process G has to be stationary and ergodic. In addition, the 
channel coding will span across infinite number of channel 
blocks to achieve error-free data transmission approaching the 
above rates. Again, the equality holds if the effective noise is 
AWGN with respect to the transmitted signal and if not, then 
the above rates represent the worst-case scenario, i.e., the lower 
bound [31]. In achieving the above rates, the receiver assumes 
that the effective noise is interference, which is independent 
of the transmitted data with Gaussian distribution and spatial 
covariance matrix ~f wp . Knowing the channel response per- 
fectly or using a better channel estimation or prediction scheme 
may result in higher achievable rates. There has also been some 
recent work in [33] on closed-loop MIMO OFDM transmission 
over a parametric frequency selective channel model. 

VI. Examples and Numerical Results 

A. SISO Systems 

To illustrate the above analysis, we start with SISO systems. In 
the SISO case, all previously defined spatial vectors and related 
matrices are now single dimensional (e.g., di, H, H, and T are 
now scalars di, h, h, and v, respectively). In Fig. 1, we present 
the rate R in (22) as a function of the power allocated to the pilot 
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Fig. 1 . Achievable open loop rates versus power allocated to the pilot, SISO 
system, SNR= 4, 12, 20 dB, coherence time K = 10, Rayleigh channel. 
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Fig. 2. Capacity efficiency ratio versus channel coherence time (K = 10, 20, 
40, 100), SISO system, SNR= 4, 20 dB, Rayleigh channel. 



(10). In this example, a pilot period K is 10 and coincides with 
the coherence time. A frequency-flat Rayleigh fading channel is 
assumed. The results are shown for the pilot arrangements cor- 
responding to both cases 1 and 2. For the ideal knowledge of the 
channel response, we apply the ergodic capacity formula [5] . Re- 
garding the achievable rates, from the above results, we observe 
that case 1 is less sensitive to the pilot power allocation than case 
2 (i.e., in case 2, R drops faster if the allocated power is different 
from the one that results in the maximum value). Further, case 1 
achieves higher maximum achievable rates than case 2. 

For the given SNR, we define the capacity efficiency ratio rj 
as the ratio between the maximum rate R (with respect to the 
pilot power) and the ergodic capacity C mXn m the case of the 



ideal knowledge of the channel response, i.e., 



Vn 



max^ R 

Cm y rj. 



(32) 



The index m and n correspond to number of transmit and re- 
ceive antennas, respectively. In Fig. 2, we show that the capacity 




40 50 60 

Pilot power [%] 



Fig. 3. Achievable open-loop rates versus power allocated to the pilot, MIMO 
system, SNR= 12 dB, coherence time K = 40, Rayleigh channel. Solid line 
corresponds to a system with the channel response estimation and dashed line 
to the case of the ideal channel response knowledge. 
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Fig. 4. Capacity efficiency ratio versus channel coherence time (K = 10, 20, 
40, 100), MIMO system, SNR= 12 dB, Rayleigh channel. 

efficiency ratio 771x1 increases with the channel coherence time. 
From the above results, we conclude that case 1 is a more effi- 
cient scheme than case 2. 

B. MIMO Systems 

In Fig. 3, we present the rate R in (22) as a function of 
the power allocated to the pilot (10) for a different number 
of transmit and receive antennas. In this and the following 
numerical examples, we consider only the pilot arrangement 
case 1 (treating case 2 as impractical for MIMO systems). We 
observe the rates for the Rayleigh channel, which are SNR= 
12 dB and the channel coherence time length K = 40. Solid 
lines correspond to a system with the channel response estima- 
tion and dashed lines to a system with the ideal knowledge of 
the channel response. Further, in Fig. 4, we show the capacity 
efficiency ratio r\ for a different number of transmit and receive 
antennas versus different channel coherence time lengths. 
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Fig. 5. (a) Ergodic capacity versus SNR, MIMO system, ideal knowledge of 
the channel response, Rayleigh channel. Solid line corresponds to open-loop 
capacity and dashed line to closed-loop capacity (perfect temporal match 
Hi_! = H; is assumed), (b) CDF of capacity, MIMO system, SNR = 4 
dB, ideal knowledge of the channel response, Rayleigh channel. Solid line 
corresponds to open-loop capacity and dashed line to closed-loop capacity 
(perfect temporal match Hi_i = H; is assumed). 



We observe that as the number of transmit antennas increase, 
the sensitivity to the channel response estimation error is 
more pronounced (while keeping the same number of receive 
antennas). For example, for the same channel coherence time 
length, the capacity efficiency ratio of the 4x4 system is lower 
than that in the case of the 3 X 4 system. 

In Fig. 5(a), we present open-loop (solid lines) and closed- 
loop (dashed lines) ergodic capacities. Idealized conditions are 
assumed, i.e., the ideal knowledge of the channel response is 
available to the transmitter and receiver and perfect temporal 
match Hi_i = (for the water pouring optimization) is as- 
sumed. Comparing the closed- and open-loop capacity, we ob- 
serve that the gains are more pronounced for lower SNR (e.g, 
for 4 x 4 system at 0 dB, the gain of the closed-loop system is 
approximately 2 dB, whereas at 12 dB, it drops below 0.5 dB). 
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Fig. 6. Achievable closed loop rates versus correlation between successive 
channel responses, MIMO system, SNR = 4 dB, coherence time K = 40, 
Rayleigh channel. Solid line corresponds to a system with the channel response 
estimation and dashed line to the case of the ideal channel response available 
at the transmitter and the receiver (but with the temporal mismatched H. ; _! ^ 



Further, we note that in the case of 2 x 4 and 1x4 systems, 
the gain practically disappears. This is explained as an effect of 
multiple receive antennas (greater than the number of transmit 
antennas), providing an already sufficient degree of diversity, 
eliminating any need for transmitter optimization. Instead of the 
ergodic capacities, when observing the cumulative distribution 
function (cdf) of the capacity, the difference is more pronounced 
[see Fig. 5(b) for = 4 dB] (see more on the "capacity versus 
outage" approach in [5]). 

From the results in Fig. 6, we observe how the temporal 
mismatch between successive channel responses (Hi-i ^ H^) 
affects the achievable rates R wp in (31). As said earlier, 
the temporal mismatch is characterized by the correla- 
tion E [h {i _ l)nm h* inm \ /r = k (for m = l,...,M and 
n = 1, . . . , N). We observe the cases when the ideal channel 
response (dashed lines) and channel response estimates (solid 
lines) are available at the transmitter and the receiver. Solid 
lines correspond to the channel response estimation where the 
pilot power is selected to maximize the achievable rate R wp . 
We observe the rates for the Rayleigh channel: SNR = 4 dB, 
and the coherence time length K = 40. Note that for k = 0 
(i.e., when the successive channel responses are uncorrelated), 
the achievable rate is lower than in the case of k — 1 (i.e., 
when the successive channel responses are fully correlated). 
The drop in the achievable rates is not substantial, even though 
the water pouring algorithm is fully mismatched for k = 0. We 
explain this behavior in the following. In the case of a Rayleigh 
channel, the matrix V^-iSi usually has M degrees of freedom 
and a small condition number of the corresponding covariance 
matrix. Consequently, even in the mismatched case, multiplying 
Hi with Vi_iSi preserves the degrees of freedom of the matrix 
Hi, resulting in a high capacity of the composite channel G 
in (29). We expect the detrimental effects of the mismatch to 
be amplified in the case of Rician channels, especially those 
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Fig. 7. Achievable closed- and open-loop rates versus correlation between 
successive channel responses, MIMO system 4x4, SNR= 4 dB, coherence 
time K = 40, Rayleigh channel. Solid line corresponds to a system with the 
channel response estimation and dashed line to the case of the ideal channel 
response available at the transmitter and the receiver (but with the temporal 
mismatched H;_! ^ H ; ). 



with large K-factor. This is because Rician channels result in 
the matrix Vi_iSi having a few dominant degrees of freedom, 
thereby making accurate feedback beneficial. 

In Fig. 7, we compare the open-loop scheme with the 
closed-loop scheme under temporal mismatch. It is observed 
that when the channel coherence is low (i.e., low correlation 
k), it is better to not use a closed-loop scheme. In the observed 
case (4 x 4, SNR = 4 dB and coherence time K = 40), for 
the correlation coefficient k < 0.7, the achievable rates for the 
closed-loop scheme are lower than in the open-loop case. 

VII. Conclusion 

In this paper, we have studied how the estimation error of the 
frequency-flat time-varying channel response affects the per- 
formance of a MIMO communication system. Using a block- 
fading channel model, we have connected results of Inform. 
Theory with practical pilot estimation for such systems. The 
presented analysis may be viewed as a study of mismatched re- 
ceiver and transmitter algorithms in MIMO systems. We have 
considered two pilot-based schemes for the estimation. The first 
scheme uses a single pilot symbol per block with different power 
than the data symbol power. The second scheme uses more than 
one pilot symbol per block, whose power is the same as the 
data symbol power. We have presented how the achievable data 
rates depend on the percentage of the total power allocated to 
the pilot, background noise level, and the channel coherence 
time length. Our results have shown that the first pilot-based 
approach is less sensitive to the fraction of power allocated to 
the pilot. Furthermore, we have observed that as the number of 
transmit antennas increases, the sensitivity to the channel re- 
sponse estimation error is more pronounced (while keeping the 
same number of receive antennas). The effects of the estimation 
error are evaluated in the case of the estimates being available 
at the receiver only (open loop), as well as in the case when the 
estimates are fed back to the transmitter (closed loop), allowing 



water pouring transmitter optimization. In the case of the water 
pouring transmitter optimization and corresponding rates, we 
have not observed significant gains versus the open-loop rates 
for the channel models considered here. Further, we observe that 
in certain cases, it is better to use the open-loop scheme as op- 
posed to the closed-loop scheme. The analysis presented here 
can be used to optimally allocate pilot power for various system 
and channel operating conditions and to determine the effective- 
ness of closed-loop feedback. 
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